Language-independent Automatic Syllable Segmentation Using Broad Phonetic Class Information

نویسندگان

  • Bogdan Ludusan
  • Emmanuel Dupoux
چکیده

We propose in this paper a language-independent method for syllable segmentation. The method is based on the Sonority Sequencing Principle, by which the sonority inside a syllable increases from its boundaries towards the syllabic nucleus. The sonority function employed was derived from the posterior probabilities of a broad phonetic class recognizer, trained with data coming from an open-source corpus of English stories. We tested our approach on English, Spanish and Catalan and compared the results obtained to those given by an energy-based system. The proposed method outperformed the energy-based system on all three languages, showing a good generalizability to the two unseen languages. We conclude with a discussion of the implications of this work for under-resourced languages.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Syllable Segmentation Using Broad Phonetic Class Information

We propose in this paper a language-independent method for syllable segmentation. The method is based on the Sonority Sequencing Principle, by which the sonority inside a syllable increases from its boundaries towards the syllabic nucleus. The sonority function employed was derived from the posterior probabilities of a broad phonetic class recognizer, trained with data coming from an open-sourc...

متن کامل

Syllable Specific Unit Selection Cost Function Using a Tone Modeling Technique for Automatic Phonetic Segmentation of Hindi Speech Using HMM

This paper presents a technique of improving tone correctness in speech synthesis of a tonal language based on an average-voice model trained with a corpus from nonprofessional speakers speech. Unit selection-based concatenative synthesis is one of the widely used speech synthesis approaches. This approach overcomes the limitations of other synthesis techniques such as articulatory synthesis an...

متن کامل

Automatic Labeling of Corpora for Speech

One of the bottlenecks in the development of text-to-speech synthesizers based on segment concatenation is the need for large, segmented and labeled corpora. Consequently, as manual segmentation and labeling is a tedious and time consuming task, there is a strong demand for automatic labeling systems which can label speech from many languages. Several systems have been proposed already, but the...

متن کامل

An Automatic Syllable Segmentation Method for Mandarin Speech

An automatic syllable segmentation method for mandarin speech is proposed. There are five features and the corresponding phonetic transcriptions used in the method. Firstly, the speech signals are pre-filtered. Secondly, the speech signal pre-filtered is segmented into 30 ms long segments and the five features of each segment are computed. Finally, syllable segmentation performs based on the ph...

متن کامل

Qualitative Evaluation and Error Analysis of Phonetic Segmentation

Speech segmentation is the process of splitting and identifying the boundaries between different units of speech, i.e., words, syllables, and phones. This paper focuses on the automatic phonetic segmentation of speech and the methods used for its evaluation. We explain the current methods used for the evaluation of speech segmentation and highlight the details that have not been sufficiently ad...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016